Improvement of speaker recognition system by individual information weighting
نویسندگان
چکیده
In speaker recognition, it is very important to use individual information extracted from speech waves. Most of the speaker recognition methods assume that each part of speech has equal amount of information to represent a speaker, although it di erently contribute to speaker recognition. The aim of this paper is to suggest a new scoring method of the HMM, which applies di erent importance to all the basic portions of a sampled speech waveform. we rst de ne the quantity of the importance of speech frames, propose how to measure it and apply to speaker recognition. The performance of the proposed method was compared to non-weighting HMM based speaker recognition system. In speaker veri cation experiments, the proposed method reduced equal error rates considerably as compared to a conventional method which treats all speech segments to have the same importance. In speaker identi cation experiments, the proposed method marked relatively 28% higher recognition rate than the baseline system, and was more robust in long-term variation. These results demonstrate that the proposed method is e cient in measuring speaker information and more appropriate for speaker recognition.
منابع مشابه
Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملAudiovisual perception in adverse conditions: Language, speaker and listener effects
This study investigated the relative contribution of auditory and visual information to speech perception by looking at the effect of visual and auditory degradation on the weighting given to visual cues for native and non-native speakers. Multiple iterations of /ba/, /da/ and /ga/ by five Australian English and five Mandarin Chinese speakers were presented to Australian English, British Englis...
متن کاملNAP, WCCN, a New Linear Kernel, and Keyword Weighting for the HMM Supervector Speaker Recognition System
We demonstrate the application of Nuisance Attribute Projection (NAP), Within-Class Covariance Normalization (WCCN), a new standard kernel, and keyword weighting for the keywordbased HMM supervector speaker recognition system. On our development set (SRE04 8-side training), we achieve 22.6% and 16.2% EER improvements using NAP and WCCN respectively, a 19.5% EER improvement using NAP and WCCN jo...
متن کاملThe use of speaker correlation information for automatic speech recognition
This dissertation addresses the independence of observations assumption which is typically made by today’s automatic speech recognition systems. This assumption ignores within-speaker correlations which are known to exist. The assumption clearly damages the recognition ability of standard speaker independent systems, as can seen by the severe drop in performance exhibited by systems between the...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000